MDL Based Fitness Functions for Learning Parsimonious Programs1

نویسنده

  • Byoung-Tak Zhang
چکیده

Genetic programming starts with an initial population of computer programs composed of elementary functions and termir_als ~9, 10, 8]. Genetic operators, such as crossover and selection, are used to adapt the shape and size of the programs and evolve increasingly fit populations. This process can be viewed as a search for a highly fit computer program, Abe•t, in the whole program space A= {A1, A2, ... }. The quality of each computer program is measured by running it over a training set lJ of input-output cases of the unknown process !: D = {(xc, Yc)}~1 , where Xc EX, Yc E Y andyc = }(xc). The domain X and the range Yare defined by the application. The goodness of the program, A, is usually measured in terms of the error: 2:~1 (YcfA(xc)) 2 , where fA is the function realized by A. Though this training accuracy can be used as a single measure for fitness, many empirical studies have shown that, as programs grow, it also become less and less likely for them to be general [7, 15]. In addition, large structures require more computer resources in space and time for the evolution. Several methods have been proposed to take into consideration of structural complexity in fitness evaluation, but relatively few attempts have been made in the genetic programming community to employ the complexity penalty in a more principled way. In this paper we use a Bayesian model-comparison method to develop a framework in which a class of fitness measures is introduced for dealing with problems of parsimony based on the

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A two phase approach to Bayesian network model selection and comparison between the MDL and DGM scoring heuristics

This paper presents an eficient algorithm for learning a Bayesian belief network (BBN) structure from a database, as well as providing a comparison between two BBN structure fitness functions. A Bayesian belief network is a directed acyclic graph representing conditional expectations. In this paper, we propose a two-phase algorithm. The first phase uses asymptotically correct structure learning...

متن کامل

Fitness Function Comparison for GA-Based Feature Construction

When primitive data representation yields attribute interactions, learning requires feature construction. MFE2/GA, a GA-based feature construction has been shown to learn more accurately than others when there exist several complex attribute interactions. A new fitness function, based on the principle of Minimum Description Length (MDL), is proposed and implemented as part of the MFE3/GA system...

متن کامل

The role of fitness and aerobic exercises on the executive functions of dyslexia and dyscalculia children

Abstract Background and Purpose: Studies carried out in recent decades show that exercises have useful impact on improving executive functions. Therefore, this study aims to investigate the roles of fitness and aerobic exercises on problem-solving process of dyslexic and dyscalculia children. Materials and methods: Using systematic review study and key terms including fitness and aerobic e...

متن کامل

Complexity-based Fitness Evaluation for Variable Length Representation

We have chosen MDL-based approach to GA and GP, mainly for the purpose of improving the robustness of learning. Its applicability has been studied in various problem domains, e.g., generating a decision tree, solving a system identiication problem, and evolving a hardware. This paper summarizes our recent research results on the complexity-based tness evaluation for the variable length represen...

متن کامل

Non-uniform Kernel Allocation Based Parsimonious HMM

In conventional Gaussian mixture based Hidden Markov Model (HMM), all states are usually modeled with a uniform, fixed number of Gaussian kernels. In this paper, we propose to allocate kernels nonuniformly to construct a more parsimonious HMM. Different number of Gaussian kernels are allocated to states in a non-uniform and parsimonious way so as to optimize the Minimum Description Length (MDL)...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012